Supporting Reference 2 



Supplemental Material can be found at: 

htip://www-.jbc.org/content/suppl,^2002/05/20/277.21.18245.DC1.htmj 



Accelerated Publication 



The Journal of Biological Chemistry 
Vol. 277, No. 21, Issue of May 24, pp. 18245-18248, 2002 
© 2002 by The American Society for Biochemistry and Molecular Biology, Inc. 

Printed in U.S.A. 



Engineering-enhanced Protein 
Secretory Expression in Yeast 
with Application to Insulin*!^ 

Received for publication, March 7, 2002, 
and in revised form, March 27, 2002 
Published, JBC Papers in Press, March 28, 2002, 
DOI 10.1074/jbc.C200137200 

Thomas Kjeldsen$, Svend Ludvigsen, 

Ivan Diers, Per Balschmidt, Anders R. S0rensen, 

and Niels C. Kaarsholm 

From the Novo Nordisk AJS, Novo Alle, 
DK-2880 Bagsvaerd, Denmark 

Adaptation to efficient heterologous expression is a 
prerequisite for recombinant proteins to fulfill their 
clinical and biotechnological potential. We describe a 
rational strategy to optimize the secretion efficiency in 
yeast of an insulin precursor by structure-based engi- 
neering of the folding stability. The yield of a fast- acting 
insulin analogue (Asp^^^) expressed in yeast was en- 
hanced 5-fold by engineering a specific interaction be- 
tween an aromatic amino acid in the connecting peptide 
and a phenol binding site in the hydrophobic core of the 
molecule. This insulin precursor is characterized by sig- 
nificantly enhanced folding stability. The improved 
folding properties enhanced the secretion efficiency of 
the insulin precursor from 10 to 50%. The precursor 
remains fully in vitro convertible to mature fast-acting 
insulin. 



Administration of the two-chain 51-amino acid peptide hor- 
mone insulin is life sustaining for many of the 150 million 
diabetic patients in the world. Classical pancreatic extraction 
results in 10-15 mg of insulin per pancreas and the number of 
porcine or bovine pancreases to produce current requirements 
of insulin (in excess of 7 metric tons) is simply not available. 
Consequently, a substantial fraction of insulin is today pro- 
duced by eukaryotic secretory expression in the yeast Saccha- 
romyces cerevisiae. Recently, second generation fast-acting ge- 
netically engineered insulin analogues have been engineered 
for improved diabetes therapy. This is exemplified by the clin- 
ically relevant fast-acting insulin, which features an amino 
acid substitution in position 28 of the B-chain from proline to 
aspartic acid and is used for treatment of diabetes mellitus (1). 

However, many biochemical and structural properties of in- 
sulin and proinsulin have evolved in response to differential 
requirements of biosynthesis, processing, transport, and stor- 
age in the j8-cells of Langerhans islets (2, 3). Importantly, 
insulin and insulin analogues are readily adapted for expres- 
sion in yeast as single-chain proinsulin-like precursors lacking 
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the connecting peptide in the following configuration: amino 
acid residues 1-29 of the B-chain connected to the A-chain by a 
short removable mini-C-peptide and fused to the yeast prepro-a 
factor through a single dibasic cleavage site (secretory expres- 
sion of insulin in yeast and in vitro enzymatic conversion of the 
precursor to insulin has recently been reviewed (4, 5)). Removal 
of the prepro-« factor by endoproteolytic cleavage by the Kex2 
endoprotease in a late Golgi compartment generates a single- 
chain insulin precursor with self-association properties and 
structure essentially similar to that of human two-chain insu- 
lin (6, 7). 

Proteins exit the lumen of the endoplasmic reticulum (ER)^ 
after folding, and frequently this is a rate-limiting step in 
secretion (8-10). Herein we describe an improvement in eu- 
karyotic secretion efficiency with application to insulin, using a 
strategy of structure-based rational engineering under the as- 
sumption that a more stably folded precursor molecule is ac- 
companied by more efficient transport through the secretory 
pathway. The structure of the Asp^^^ insulin precursor, termed 
insulin aspart, as a hexamer and its binding sites for small 
molecule ligands (11) are used to develop novel C-peptides 
tailored to provide structural stability to the precursor mole- 
cule. The resulting enhancement of the expression yield corre- 
lated with increased structural stability and is rationalized in 
terms of the structure of the novel precursor molecule. 

EXPERIMENTAL PROCEDURES 

General Molecular Biology Techniques — Materials, strains, media, 
and general molecular biology techniques and yeast expression were as 
described previously unless otherwise mentioned (12-14). Expression of 
insulin precursors in S. cerevisiae was performed in the following con- 
figuration: a-factor leader-KR-spacer-insulin precursor, where KR is 
the Kex2 dibasic endoprotease processing site. To optimize processing of 
the fusion protein by the S. cerevisiae Kex2 endoprotease, a spacer 
peptide, EEAEAEAPK, was present between the leader and the insulin 
precursor (13). The mature insulin precursor was secreted as a single- 
chain N-terminally extended proinsulin-like polypeptide with a short 
synthetic peptide connecting Lys^^^ and Gly^^. After purification of the 
insulin precursor and proteolytic removal of the N-terminal extension 
and connecting peptide, the amino acid Thr^^° can be added to Lys^^^ by 
trypsin-mediated transpeptidation to generate insulin (15). 

Quantification and Purification of Insulin Precursors — Quantifica- 
tion of the insulin aspart precursor yield in the culture supernatants 
was performed as previously described (12, 16). Pulse-chase analysis 
employed 2.5-min metabolic labeling with P^S] cysteine using cultures 
with anAgoo of — 10 as previously described (12). The insulin precursor 
was captured from acidified and clarified culture supernatant by ad- 
sorption to a cation exchange column. The eluted protein was further 
purified by preparative reverse phase high pressure liquid chromatog- 
raphy on a C 18 silica column with an ethanol/water gradient in phos- 
phate buffer at pH 3. The main protein peak was collected and finally 
lyophilized after desalination. 

Guanidine Hydrochloride (GdnHCl) -induced Protein Denaturation — 
The GdnHCl-induced unfolding was measured by far-UV CD using a 
Jasco J-715 spectropolarimeter at pH 8.0 as described (17). The unfold- 
ing curves display the relative change in CD at 224 nm (AeQt,g^j — 
^^base)/~^^base ^ fuuctiou of deuaturaut concentration, where Ae^^g^ 
is the observed CD in the (horizontal) pre-transition zone. For each 
precursor, C^.^ is the concentration of GdnHCl corresponding to un- 
folding of one-half of the population. 

NMR Spectroscopy — NMR spectra were recorded at 600 and 800 



^ The abbreviations used are: ER, endoplasmic reticulum; GdnHCl, 
guanidine hydrochloride; NOESY, two-dimensional nuclear Over- 
hauser enhanced spectroscopy; NOE, nuclear Overhauser enhance- 
ment. 



This paper is available on line at http://www.jbc.org 



18245 



18246 



Engineering-enhanced Eukaryotic Secretion 





Fig. 1. The hexamer structure (a) of insulin aspart precursor^^ 
(Protein Data Bank code IZEI) determined by x-ray crystallography 
(11) has two unique binding sites (shown in blue) for m-cresol in the 
third dimer of insulin aspart precursor^^ hexamer, in addition to the 
classical insulin m-cresol/phenol binding sites (shown in red). The two 
extra m-cresol molecules are inserted between the monomers in almost 
symmetrically equivalent binding sites (6), which are composed of res- 
idues Tyr^26 y^iBi2 f^jjj Qj^g monomer and Leu^^^ Tyr^^^ Cys^^^ 
and Phe^^* from the second monomer of the dimer. For comparison of 
the insulin aspart precursoH^^ dimer structure with the classical in- 
sulin dimer structure, see panel c). In solution, at monomeric conditions 
without /n-cresol present, the tryptophan side chain in the engineered 
mini-C-peptide binds to the wi-cresol binding site (in a slightly shifted 
position), thus providing an anchor point for the elongated Al-helix (e). 
This aromatic amino acid binding pocket in the insulin aspart precur- 
gQj.EWK -g comparable with the m-cresol binding pocket found in the 
crystal state of the insulin aspart precursor^^^ {d). 



MHz using a Varian Inova. Procedures for sample preparation at pH 8, 
spectral recording, and assignment as well as methods for determina- 
tion of three-dimensional structures have been described recently (Ref. 
18, and references cited therein). 

RESULTS AND DISCUSSION 

Engineering the Insulin Aspart Precursor to High Yield Yeast 
Secretory Expression — Binding of phenol to the insuHn hex- 
amer causes well described classical allosteric conformational 

changes, i.e. the Tg —> T3R3 Rg transformations (6, 19, 20). 
The Kg hexameric crystal structure of the insulin aspart pre- 
cursor^^^^ (for simplicity the sequence of the mini-C-peptide is 
indicated in superscript) (Protein Data Bank code IZEI) has a 
number of unique features compared with the classical Rg 
insulin hexamer (11). The monomers, which comprise the hex- 
amer, are composed of traditional structural elements common 
to the R-state insulin, i.e. an a-helix (B1-B19), a turn (B20- 
B23), one-half anti-parallel j3-sheet (B24-B28), and a loop 
(B29-C1) (Fig. 1). The A-chain is composed of a slightly ex- 
tended Al a-helix (C2-C3, A1-A9), a loop (A10-A13), and 
finally the A2 a-helix (A14-A20). Besides the classical phenol 
or m-cresol binding sites, one of the three dimers has two 
additional ?n-cresol molecules located in the monomer-mono- 
mer interface next to Tyr^^® (Fig. 1). These two binding sites 
are essentially identical and composed by the hydrophobic res- 
idues Tyr^^^ and VaP^^ from one monomer and Leu^^^, Tyr^^^, 
QygBi9^ and Phe^^^ of the other monomer molecule (Fig. 1). 

It has been shown that folding stability is important for 
secretion efficiency (5, 8-10). We hypothesized that the side 
chain of an aromatic amino acid localized in the mini-C-peptide 



Table I 

Yeast expression yield and folding stability of insulin precursors 
The expression yield was determined as described under "Experimen- 
tal Procedures" and normalized for comparison to the yield of the 
insulin aspart precursoH^^. Folding stability of the precursors is de- 
scribed as C^id, the concentration of GdnHCl corresponding to unfold- 
ing of one-hgilf the population. C^^^ is derived from Fig. 3. ND, not 
determined. The insulin precursor comprising the first 29 amino acids 
of the insulin B-chain joined to the 21 amino acids of the insulin A-chain 
either by a C-peptide or directly as indicated and expressed in S. 
cerevisiae as described under "Experimental Procedures." 



Insulin precursor 
amino acid substitution 



C-peptide Relative yield C^id 









M 


AspB28 


AAK 


1.0 


5.2 


Asp^^« 




1.2 


5.6 


AspB2« 


EWK 


4.9 


7.2 


Asp^^^ 


EYK 


3.8 


ND 


Asp^2« 


EFK 


3.5 


ND 


AspB2« 


LWK 


3.6 


6.4 


GluBl°, AspB2^ j^-gAS^ Ql^^A14 




5.3 


7.0 


Glu^^°, Asp^^^ His^^ Glu^^^ 


EWK 


7.6 


>7.9 



will mimic the interactions provided by the ?n-cresol ligand 

shown in Fig. 1. Importantly, the interaction of this hydropho- 
bic residue with the hydrophobic core of the molecule would be 
energetically favorable and presumably enhance the folding 
stability of the precursor. The precursor can be converted into 
the mature two-chain insulin aspart by in vitro enzymatic 
removal of the C-peptide, which restores full biological poten- 
tial of the molecule. 

Introduction of an aromatic amino acid into the mini-C- 
peptide, e.g. EWK, did indeed increase the expression yield of 
the insulin aspart precursor (Table I). Tryptophan had the 
greatest positive influence (6-fold) on the expression yield, 
whereas phenylalanine and tyrosine increase the expression 
yield -^3. 5-fold (Table I). Mini-C-peptides with various amino 
acids in combination with the aromatic amino acid, e.g. LWK, 
enhanced the expression yield of the insulin aspart precursor 
(Table I). Interestingly, the position of the aromatic amino acid 
in the mini-C-peptide influenced the expression yield of the 
insulin aspart precursor. When the aromatic amino acid is in 
an N-terminal position to the lysine in the mini-C-peptide the 
secretion efficiency of the insulin aspart precursor is increased. 
Exchanging the sequence of the mini-C-peptide to WEK or 
shortening the mini-C-peptide to WK did not significantly in- 
crease the expression yield of the insulin aspart precursor. The 
significance of the position of tryptophan indicates that a spe- 
cific molecular interaction mediates the increase in the secre- 
tion efficiency of the insulin aspart precursor. 

The effect of an aromatic acid in the mini-C-peptide (EWK) 
on the secretion of the insulin aspart precursor expressed in 
yeast was further investigated by pulse-chase analysis with a 
2.5-min pulse with P^S]cysteine followed by 30 min of chase. 
The quantity of insulin aspart precursor with the EWK mini- 
C-peptide synthesized after the P^S] cysteine pulse is substan- 
tially larger than the quantity of precursor synthesized with 
the AAK mini-C-peptide (only the insulin aspart precursor is 
labeled) (Fig. 2). Furthermore, the quantity of the insulin as- 
part precursor^^^ secreted to the culture supernatant after 30 
min of chase was six times higher than with the AAK mini-C- 
peptide. Interestingly, a similar quantity of synthesized insulin 
aspart precursor was retained intracellularly in the vacuole 
independent of the mini-C-peptide. However, the insulin aspart 
precursor^^^ was more efficiently secreted (Fig. 2). The se- 
cretion efficiency of the insulin aspart precursor^^^ is illus- 
trated by the ratio of secreted precursor increased from 10 to 
50% (Fig. 2). 

Folding Stability of Insulin Aspart Precursors — To evaluate 
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Fig. 2. Secretion of the insulin aspart precursor with either 
the AAK C-peptide or the EWK C-peptide expressed in S. cerevi- 
siae analyzed by pulse-chase analysis. The S. cerevisiae cells were 
pulsed for 2.5 min with [^^S] cysteine and subsequently chased for 0, 5, 
10, or 30 min with an excess of unlabeled cysteine. After the pulse-chase 
experiment, proteins were separated by SDS-PAGE and quantified by 
Phosphorlmager analysis (Molecular D3Tiamics). The secretion kinetic 
of the insulin aspart precursor with the AAK mini-C-peptide is shown 
as a solid line and with the EWK mini-C-peptide as a dashed line. 
Insulin aspart precursor localized intracellularly (IC) is indicated by T 
and ■ and extracellularly (EX) by ▲ and The data summarize 
labeled precursor both as fusion protein and as mature precursor (only 
the precursor was labeled), and the data were normalized for compar- 
ison using endogenous protein as internal standard. 




Fig. 3. Unfolding curves for native and mutant insulin precur- 
sors expressed as the relative change in CD at 224 nm as a 
function of GrdnHCl concentration. insulin aspart precursoH^^; 
+, insulin aspart precursor" (minus indicates a direct connection be- 
tween B29 and Al); insulin aspart precursor^^^; T, insulin aspart 
precursor" with the additional amino acid substitutions Glu^-^°, Asp^^®, 
His^®, and Glu^^^; insulin aspart precursor^^^; and ♦, insulin 
aspgirt precursor^^^ with the additional amino acid substitutions 
Glu^-^°, Asp^^®, His^®, and Glu^^^. The insulin aspart precursors, except 
insulin aspart precursor^^, featured a N-terminal extension with the 
sequence E(EA)3PK that improved Kex2 endoprotease processing dur- 
ing secretion (see "Experimental Procedures" for further details). 



whether the improved secretion of insulin aspart precursors 
reflected improved folding, the precursors were purified and 
the in vitro folding stability determined. The in vitro folding 
stability of the precursors was assessed by titrations with the 
protein denaturant GdnHCl and detection by far-UV CD spec- 
troscopy. The far-UV CD as a function of increasing concentra- 
tions of GdnHCl monitors the loss of secondary structure that 
accompanies protein unfolding. As shown in Fig. 3, the intro- 
duction of EWK and LWK mini-C-peptides strongly enhances 
the folding stability of the insulin aspart precursor. These data 
suggest that the in vitro folding stability is positively correlated 
with expression yield. To further investigate this correlation, 
additional amino acid substitutions, Glu^^^, His^^, Glu^^^, 
were introduced to stabilize the a-helices of the insulin aspart 
precursor with and without the EWK mini-C-peptide (see Ref. 
17 for design of helix capping). The helix-capping substitutions 
enhanced the folding stability and concomitantly increased the 
expression yield of the precursor 4.2-fold (Table I). Moreover, 
the combination of the EWK mini-C-peptide and these amino 
acid substitutions further improved the folding stability of the 
precursor, and the expression yield was increased 7.6-fold rel- 
ative to the insulin aspart precursor^^ (Table I). Indeed, the 
combination of the EWK mini-peptide and the a-helix capping 
mutations results in an insulin species that remains essentially 
folded at 7 M GdnHCl. This corresponds to about a 200-fold 
shift in the forms in favor of the folded state in comparison with 
the insulin aspart precursor^^^^ species. Thus, the folding sta- 
bility is a principal parameter in determining eukaryotic secre- 
tion efficiency. The previously described natural and sjnithetic 
C-peptides of insulin appear to offer little structural support or 
interaction with the remainder of the molecule. In contrast, the 
C-peptides described here have significant impact on the over- 
all structure of the molecule by both intramolecular interaction 
and a direct contribution to the structural elements of the 
precursor. 

The activities of human insulin and insulin aspart are sim- 
ilar when determined by both metabolic potency (lipogenesis) 
and by receptor affinity (21). However, the single-chain insulin 
precursor has --^0.1% activity of the two-chain mature mole- 
cules (22). This has been interpreted that flexibility in the C 
terminus of the B-chain and a free N terminus of the A-chain 
are required for activity (22, 23). The activity (lipogenesis) of 
the described single chain insulin aspart precursor featuring 
tryptophan in the C-peptide is further decreased 10-fold to 
—0.01% of human insulin. The low activity of this precursor is 



consistent with decreased flexibility in the C terminus of the 
B-chain and the N terminus of the A-chain because this region 
has been anchored by the tryptophan to the hydrophobic core of 
the molecule. Importantly, in vitro conversion of the insulin 
aspart precursors to the two-chain insulin aspart molecule 
restores full activity to the mature two-chain molecule (21). 

Tertiary Structure of the Insulin Aspart Precursor — To assess 
the structural mechanism behind the concomitantly improved 
secretion and folding stability, the structure of the insulin 
aspart precursor was determined. The structure of the insulin 
aspart precursor^^^ was determined by NMR spectroscopy 
based on two-dimensional proton NMR spectra at pH 8.0 and 
1.0 mM. At these conditions the precursor does not dimerize or 
otherwise self-associate to any significant degree. Structure 
elucidation of the insulin aspart precursor^^^ from NMR data 
agrees with other structures determined by similar methods 
(see supplementary table). The two-dimensional NOESY spec- 
tra of the insulin aspart pre curs or^^^ are significantly differ- 
ent from similar spectra of the insulin aspart precursor^"^ 
(data not shown). 

The structure of the insulin aspart precursor^^^ (Fig. 4) 
shares secondary structural elements with other insulin struc- 
tures (24) including the extended structure from B1-B8, the 
a-helix B9-B19, a turn B20-B23, and a half-jS-sheet structure 
from B24 to B27. However, the EWK mini-C-peptide and espe- 
cially Trp*^^ induce a number of distinct structural differences 
compared with all previously described insulin structures 
(Figs. 1 and 4). Importantly, Trp^^ is partly inserted into the 
binding site that is described above, which binds m-cresol/ 
phenol in the insulin aspart precursor^^^^ (Fig. 1). The trypto- 
phan side chain moiety packs against Ile^^ and Val^^ as its 
helical neighbors to Leu^^^, VaP^^, and Leu^^'^ in the central 
B-chain helix and in the C-terminal part of the B-chain to 
rjiyj,B26 ^j^jj Asp^^^. Furthermore, Tyr^^® together with Asp^^^ 
have been rearranged to open the m-cresol binding site to Trp^^ 
(Fig. 4). 

Furthermore, the insertion of the tryptophan into the hydro- 
phobic core results in a re-orientation of the half-j8-sheet struc- 
ture from B24 to B27 and extension of the Al a-helix of the 
A-chain (Fig. 4). The two-dimensional NOESY spectra of the 
insulin aspart precursor^^'^ show an extensive number of 
NOEs indicative of an ce-helix from C2-A8. However, these 
NOEs are almost absent in the spectra of insulin aspart pre- 
cursor^^. Furthermore, the chemical shift values of a-protons 
assigned for residues of the insulin aspart precursor^^^ in this 
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Fig. 4. Schematic representation of insulin aspart precur- 
g^j^EWK structure as determined by NMR displayed using Mol- 
script (26). The second A-chain helix is semi-transparent to aUow 
visuahzation of the interaction of Trp*^^ to its structural neighbors 
shown in stick representation. Besides NOEs to the sequentially neigh- 
boring residues, tryptophan*^'^ has several NOEs to residues Ile^, 
VaI^^ LeuBll, VaP^^^ LeuBl^ Tyr^^^ and Asp^^s rj^^ residues with 
NOEs to Trp^^ form a hydrophobic cleft almost equivalent to the m- 
cresol binding pocket described in the insulin aspart precursoH^^ 
structure by x-ray crystallography (11) (see text for details). 

region are on average shifted 0.35 up-field (lower ppm value) 
supporting that the residues of the EWK mini-C-peptide and 
the five first residues in the A-chain of the insulin aspart 
precursor^^^ reside in an a-helical structure. Interestingly, 
the x-ray-derived structure of the insulin aspart precursor^^ 
also indicates an a-helix from C2-A8 (11). We conclude that the 
enhanced folding stability originates from the extended 
A-chain helix, which is anchored to the molecule by hydropho- 
bic packing of the tryptophan side chain. 

Moreover, the half-j8-sheet structure from B24 to B27 is 
moved further away from the center of the molecule compared 
with other insulin structures. The residues B28, B29, and CI 
form a small ill defined coil structure, which is the starting 
point of the well defined a-helix from C2 to A8. This structural 
re-organization of the C-terminal part of the B-chain accommo- 
dates both the extension of the Al a-helix and the insertion of 
the tryptophan residue side chain into the hydrophobic core of 
the molecule (Fig. 4). 

A novel aspect of the present studies is that structural im- 
provement of molecular stability concomitantly enhances the 
efficiency whereby insulin traffics through the eukaryotic se- 
cretory pathway. Moreover, the enhanced folding stability was 
engineered by a hydrophobic interaction based on a specific 
phenol binding site in the core of the molecule. Furthermore, 
this intramolecular interaction introduces unique structural 
elements into the insulin molecule. Thus, the expressed mole- 
cule is not a mutant version of fast-acting insulin but a rational 
structurally adapted precursor that is readily converted into 
the native, two-chain structurally identical fast-acting insulin. 
Intriguingly, it was possible to further enhance the secretion 
efficiency in a synergistic manner by combing this novel stabi- 
lizing intramolecular interaction with more classical stabiliza- 



tion of the secondary structure elements. Interestingly, it has 
been shown that mutants of bovine pancreatic trypsin inhibitor 
with impaired folding properties also were expressed ineffi- 
ciently in yeast (9, 10). This was interpreted that the thermo- 
dynamic stability can determine the efficiency of escape from 
ER, presumably because of degradation of inefficiently folded 
molecules. However, it has also been shown that insulin ex- 
pressed in a sec 18-1 mutant yeast strain, where the molecule 
cannot reach the Golgi complex, is not degraded, strongly in- 
dicating that insulin is not subjected to ER- associated degra- 
dation in yeast (25). The presented data support a model where 
the structural and folding properties of the insulin molecule are 
key factors for efficient biosynthesis and secretion. 
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